Picture for Tianxin Wei

Tianxin Wei

Adaptive Auto-Harness: Sustained Self-Improvement for Agentic System Deployment on Open-Ended Task Streams

Add code
Jun 01, 2026
Viaarxiv icon

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Add code
May 28, 2026
Viaarxiv icon

Code as Agent Harness

Add code
May 18, 2026
Viaarxiv icon

PAPERMIND: Benchmarking Agentic Reasoning and Critique over Scientific Papers in Multimodal LLMs

Add code
Apr 23, 2026
Viaarxiv icon

ReMix: Reinforcement routing for mixtures of LoRAs in LLM finetuning

Add code
Mar 10, 2026
Viaarxiv icon

MC-Search: Evaluating and Enhancing Multimodal Agentic Search with Structured Long Reasoning Chains

Add code
Mar 01, 2026
Viaarxiv icon

FeDecider: An LLM-Based Framework for Federated Cross-Domain Recommendation

Add code
Feb 17, 2026
Viaarxiv icon

TSAQA: Time Series Analysis Question And Answering Benchmark

Add code
Jan 30, 2026
Viaarxiv icon

Agentic Reasoning for Large Language Models

Add code
Jan 18, 2026
Viaarxiv icon

CoReflect: Conversational Evaluation via Co-Evolutionary Simulation and Reflective Rubric Refinement

Add code
Jan 18, 2026
Viaarxiv icon